refactor(models): unify model configuration and standardize LLM client factory #283

Dallas98 · 2026-01-26T06:55:49Z

This pull request introduces several key improvements and refactorings across the backend and frontend, primarily focusing on unifying model configuration management, updating service routing, and refactoring how LLM (Large Language Model) clients are created and used. The changes standardize the model configuration table and ORM usage in both Java and Python, introduce a new LLM factory for consistent model client instantiation, and update proxy and routing logic for better service separation.

Key changes include:

Model Configuration Standardization

Renamed the model configuration table from t_model_config to t_models in both Java (ModelConfig.java) and Python (models.py), and updated the corresponding ORM class names and references throughout the codebase. Added an is_deleted field and switched boolean fields to use Boolean type instead of Integer for clarity and consistency. [1] [2] [3] [4] [5]
Updated all usages of the old model config logic in evaluation and RAG modules to use the new Models class and get_model_by_id helper, ensuring consistent model retrieval and property access. [1] [2] [3] [4] [5] [6] [7]

LLM Client Factory Introduction

Introduced a new LLMFactory module (app/module/shared/llm/__init__.py) to centralize and standardize the creation of chat and embedding model clients, as well as related utilities. Updated generation and RAG services to use this factory for all LLM interactions, replacing previous ad-hoc client creation logic. [1] [2] [3] [4] [5] [6] [7]

API Gateway and Frontend Proxy Routing

Updated the API gateway routing to unify Python backend service routes under a single route and expanded the path matching to include /api/models/**.
Refactored the frontend Vite proxy configuration to dynamically route requests to either the Python or Java backend based on the API path, improving maintainability and service separation.

RAG Service Dependency Injection

Improved the FastAPI RAG interface to use dependency injection for the RAGService, simplifying endpoint logic and making it easier to manage dependencies. [1] [2]
Cleaned up the RAG service constructor and removed unused background task logic.

These changes collectively improve maintainability, consistency, and scalability of model management and service orchestration across the system.

…ks; add is_deleted field to model config

…odelConfig

Dallas98 added 11 commits January 21, 2026 12:05

feat: add model configuration routes and update schema exports

e015513

feat: implement LLMFactory for unified model creation and health chec…

e107489

…ks; add is_deleted field to model config

Merge branch 'refs/heads/main' into dev

c1ba1ee

Merge branch 'refs/heads/main' into dev

5b73115

Merge branch 'refs/heads/main' into dev

152f5cd

feat: implement LLMFactory for unified model creation and health chec…

477af37

…ks; add is_deleted field to model config

Merge branch 'main' into dev

7710856

feat: update routing for Python service and add is_deleted field to M…

63bbc47

…odelConfig

Merge branch 'main' into dev

d8f92f3

feat: update API proxy configuration and enhance model size limit

9f6e094

Merge branch 'main' into dev

1478ac4

Dallas98 changed the title ~~Dev~~ refactor(models): unify model configuration and standardize LLM client factory Jan 30, 2026

Dallas98 merged commit f40893c into main Jan 30, 2026
22 checks passed

Dallas98 deleted the dev branch January 30, 2026 10:29

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

refactor(models): unify model configuration and standardize LLM client factory #283

refactor(models): unify model configuration and standardize LLM client factory #283

Uh oh!

Dallas98 commented Jan 26, 2026 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

refactor(models): unify model configuration and standardize LLM client factory #283

refactor(models): unify model configuration and standardize LLM client factory #283

Uh oh!

Conversation

Dallas98 commented Jan 26, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Model Configuration Standardization

LLM Client Factory Introduction

API Gateway and Frontend Proxy Routing

RAG Service Dependency Injection

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Dallas98 commented Jan 26, 2026 •

edited

Loading